PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.4190s0013.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 301aa    MW: 35405.5 Da    PI: 6.7511
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.4190s0013.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix83.92.1e-2647129186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          +W+ +e+++Lie+r e+++++ ++k++k lWe +s+km+++ f rsp+qCk+kw+nl +r+k ++++e +    +++++p++d+++
  Cagra.4190s0013.1.p  47 QWSVEETKELIEIRGELDQTFMETKRNKLLWEVISNKMKDKSFPRSPEQCKCKWKNLVTRFKGCETMEAET---ARQQFPFYDDMQ 129
                          7********************************************************************84...56679*****96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007177.9E-544106IPR001005SANT/Myb domain
CDDcd122034.20E-2346111No hitNo description
PfamPF138374.0E-2147129No hitNo description
PROSITE profilePS500907.92148104IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005730Cellular Componentnucleolus
GO:0016592Cellular Componentmediator complex
Sequence ? help Back to Top
Protein Sequence    Length: 301 aa     Download sequence    Send to blast
MDGHQHHHQL HHLQYLNKHH LQHPHPQSQS QTPEIASPVV GGDRFPQWSV EETKELIEIR  60
GELDQTFMET KRNKLLWEVI SNKMKDKSFP RSPEQCKCKW KNLVTRFKGC ETMEAETARQ  120
QFPFYDDMQI IFTTRMQRML WAESEGGGGG GGGGGGGTSG TARKREYSSD EEEENVNEEQ  180
VDVSNDPKIL NPKKNIAKKR KGGSNSINSN NGVREVLEEF MRHQIRMESE WREGWEAREK  240
ERAEKEEEWR RKMEELEKER VTMERMWRDR EEQRRSREEM RAEKRDSLIN ALLAKLTRDD  300
*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2ebi_A1e-1244125384DNA binding protein GT-1
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1191200PKKNIAKKRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0030281e-180AC003028.3 Arabidopsis thaliana chromosome 2 clone F16M14 map ve018, complete sequence.
GenBankAF4535821e-180AF453582.1 Arabidopsis thaliana GT-1 like transcription factor (GT1L) mRNA, complete cds.
GenBankCP0026851e-180CP002685.1 Arabidopsis thaliana chromosome 2, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010509204.11e-149PREDICTED: trihelix transcription factor GT-3b-like
SwissprotO804501e-135TGT3B_ARATH; Trihelix transcription factor GT-3b
TrEMBLD7LLG41e-145D7LLG4_ARALL; Predicted protein
STRINGAl_scaffold_0004_23821e-145(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM22152870
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G38250.14e-96Trihelix family protein